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(ii) Rejections Under 35 U.S.C. §102. Claims 1, 6-8 and 26-30 remain rejected as al- 
legedly anticipated by Aghajari et al., Protein Science, 1998, 7:564-572 ("Aghajari"). The rejec- 
tion is traversed on the grounds that Aghajari does not disclose the instantly claimed methods. 

In first response to the rejection, the claims have been amended to be directed to meth- 
ods that call for " utilizing the calculated dimensions, chemical and/or electrostatic properties ob- 
tained in step b for steps b and d to identify compounds that contain functional groups that can be 
accommodated by said cavity." See step "c" in claims 1, 26, 27, 29 and 30 and step "d" of claim 
28. Aghajari merely defines the coordinates of a chloride ion that is already present in the allos- 
teric site of the cc-amylase enzyme. Thus, Aghajari fails to utilize any information concerning the 
physical properties of a cleft to identify compounds that contain functional groups that can be ac- 
commodated by a cavity. For at least this reason, Aghajari does not anticipate the instant claims. 

On at a more basic level, the disclosure of Aghajari is limited to the description of the 
binding of a chloride ion on an a-amylase enzyme. A chloride ion is not a "compound." Neither 
does a chloride ion contain a "functional group." Accordingly, Aghajari fails to identify a com- 
pound that contains functional groups that can be accommodated by a cavity on a protein. For at 
least this reason, additionally, Aghajari does not disclose the instantly claimed methods. 

Finally, Aghajari does not anticipate the instant claims because Aghajari does not ex- 
plicitly or inherently disclose a "method of identifying a compound that modulates interaction be- 
tween a functionally critical site of a target protein and a modifier" (as recited in the preamble of 
e.g., claim 1) comprising the steps set forth in the claims and "thereby identifying said compound 
that modulates intermolecular interaction at said functionally critical site of said target protein and 
said modifier," as recited at the end of each of the independent claims (emphasis added). Agha- 
jari merely describes the crystal structure of a-amylase with a bound activating chloride ion. 
Aghajari does not use the crystal structure to identify the chloride ion as a modulator of a-amylase 
function. Indeed, this activity was previously known. Thus, by no fair interpretation of the 
phrases "method of identifying a compound" and "thereby identifying said compound" that are 
recited respectively in the preamble and wherefore-clause of the claims can Aghajari be consid- 
ered to anticipate the instant claims 
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For at least all of the reasons set forth above, Aghajari does not anticipate the instantly 
claimed methods. Reconsideration of the instant claims and withdrawal of all rejections thereof as 
anticipated by Aghajari is request, accordingly. 

CONCLUSION 

This application is believed to be in condition for allowance, which is earnestly solic- 
ited. 

if there are any other issues remaining which the Examiner believes could be resolved 
through either a Supplemental Response or an Examiner's Amendment, the Examiner is respect- 
fully requested to contact Applicant's representative at the telephone number indicated below. 



Dated: September 25, 2006 Respectfully submitted, 

Mitchell Bernstein, Ph. D . 

Registration No.: 46,550 
DARBY & DARBY P.C. 
P.O. Box 5257 

New York, New York 10150-5257 
(212) 527-7700 
(212) 527-7701 (Fax) 
Attorneys/Agents For Applicant 
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Molecular Dynamics Simulation of E. coli 
Ribonuclease Hi in Solution: Correlation with NMR 
and X-ray Data and Insights into Biological Function 

Marios Philippopoulos 1 and Carmay Lim 2 * 



A 500 ps molecular dynamics simulation of Escherichia coli RNase Hi in the 
presence of explicit water molecules has been carried out to aid in the 
interpretation of NMR N-H backbone model free parameters and X-ray 
B-factor values of the free enzyme. Both experimental techniques have 
revealed unusual structural and dynamic features of the protein. Atomic 
fluctuations (J3-factors) and re-orientational motions of the backbone 
heteronuclear bonds (order parameters) computed from the simulation are 
compared with results obtained from experiments. Qualitative agreement 
is obtained between the computed and X-ray E-factors, whereas the 
agreement between the computed and NMR generalized order parameters 
is as good as quantitative for most residues. Reasons for significant 
discrepancies, the physical basis and the plausible biological consequences 
of the observed protein dynamics are discussed. 
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Introduction 

Ribonuclease H (RNase H) is an endonuclease 
that specifically cleaves the RNA strand of a 
DNARNA hybrid, yielding a 3'-hydroxyl and a 
5'-phosphate. Experimental data suggest that (1) 
RNase H of cellular origin is involved in DNA 
replication or repair. (2) RNase H of retroviral 
origin is required to reverse-transcribe the viral 
single-stranded RNA genome into double-stranded 
DNA (Crouch, 1990). RNase H cleavages at the 
origin of plus strand DNA synthesis are species 
specific (Luo et al., 1990). Because of its crucial role 
in infection by retroviruses, RNase H is a potential 
target for drugs against retroviral diseases, like HIV 
infections and AIDS. 

Among the family of RNase H domains, 
Escherichia coli RNase Hi is the most extensively 
studied. It consists of 155 amino acid residues 
containing five a helices and five P strands, as well 
as five reverse turns (Katayanagi et al., 1992), three 
of which are part of two loops and the C-terminal 
region. The secondary structure elements are listed 



Abbreviations used: MD, molecular dynamics; rmsd, 
root-mean-square deviation; NOE, nuclear Overhauser 
enhancement; IMAC, immobilized-metal-affinity- 
chromatography; PDB, protein data base. 



in Table 1 according to the definitions employed by 
Yang et al. (1990; see Figure 1). The E. coli sequence 
numbering is employed in this paper. RNase Hi has 
an isoelectric point of 9.0 (Kanaya et al., 1989) and 
an optimal pH of 8.0 for enzymatic activity (Oda 
et al., 1994). There are seven highly conserved 
residues in the enzyme (Yang et al. t 1990); AsplO, 
Glu48, Asp70, Ser71, Hisl24, Asnl30 and Aspl34 
(see Figure 1). The first three form the catalytic 
active site. The residues responsible for substrate 
binding and catalytic reaction have been identified 
from the following. (1) Site-directed mutagenesis 
(Kanaya et al., 1990, 1991a,b) and chemical modi- 
fication studies (Hogrefe et ai, 1990; Uchiyama 
et al, 1994). (2) X-ray structures of E. coli RNase Hi 
solved by multi-wavelength anomalous diffraction 
to 2.0 A (Yang et al, 1990; protein data base (PDB) 
entry 1RNH) and multiple isomorphous replace- 
ment to 1.48 A (Katayanagi et al, 1992; PDB entry 
2RN2), a 2.8 A X-ray structure of E. coli RNase Hi 
complexed with a metal ion (Katayanagi et al., 
1993a), crystal structures of active site mutants of 
E. coli RNase Hi (Katayanagi et al., 1993b), and 
model structures of the enzyme-substrate (Yang 
et al., 1990; Nakamura et ai., 1991; Katayanagi et a/., 
1992). (3) NMR studies of nucleic acids and metal 
ions binding to E. coli RNase Hi (Nakamura et al, 
1991; Oda et al, 1991, 1993b, 1994; Huang et al, 
1994). pK a values of histidines (Oda et al, 1993a) 
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and carboxyl groups (Oda et al. f 1994); and 
backbone dynamics of the free enzyme (Mandel 
et ai., 1995). The results from these studies are 
summarized below. 

Binding/specificity 

The loops between Pi and 02, between otc and clq 
(the "handle" region), and between % and ole have 
been implicated in substrate binding (Mandel et ai., 
1995). The crystal structures of Mg 2+ -free (Yang 
et ai., 1990; Katayanagi et ai., 1992) and Mg 2+ -bound 
(Katayanagi et ai., 1993a) RNase Hi indicate; (1) at 
least one metal cation binding site involving the 
side-chains of Asn44 and Glu48. (2) A phosphate 
binding site roughly 13.7 A from the catalytic site 
at the NH 2 termini of a A and ol d . Site-directed 
mutagenesis and NMR titration experiments indi- 
cate that Cysl3, Asnl6, Asn44 ( Asn45 and Gln72 are 
engaged in hybrid binding (Nakamura et ai., 1991), 
in addition to the active site acidic residues, AsplO, 
Glu48 and Asp70 (Kanaya etai., 1990). Furthermore, 
the NMR-observed enhanced backbone dynamics 
of residues 89 to 99 in the handle region, Glyl5 
and Asnl6 in the p,-p 2 loop and Glyl23, Alal25 
and Glyl26 in the Ps-oce loop suggest a role 
in conforming the substrate-binding cleft to the 
substrate surface and positioning catalytic residues 
(Mandel et ai., 1995). 

Catalysis 

The active site residues AsplO, Glu48 and Asp70 
are crucial for activity; amidation of any of these 
carboxylates (Asp Asn and Glu -> Gin) abolished 
activity, and single methylene shifts in side-chain 

Table 1. Average B- factors for secondary structure 



elements 


Secondary 


Residue 




B-factor (A2) 




structure 


range 


MD 


2RN2 


1RNH 


PI 


4-13 


5.5 


14.8 


12.3 


P2 


18-27 


5.2 


13.9 


10,1 


P3 


32-42 


6.6 


15.8 


13.4 


P4 


64-69 


6.0 


14.4 


13.5 


P5 


115-120 


8.1 


16.8 


16.5 


oeA 


43-58 


5.8 


12.5 


6.7 


aB 


71-80 


7.2 


16.6 


8.2 


aC 


81-88 


10.3 


25.2 


9,8 


aD 


100-112 


6.7 


13.5 


9.9 


aE 


127-142 


6.7 


17.4 


15.0 


Turn i2 


14-17 


10.2 


23.3 


18.3 


Turr»23 


28-31 


25.2 


31.0 


34.7 


L00pA4 


59-63 


15.8 


23.0 


26.7 


HandlecD 


89-99 


22.7 a 


18.0 


9.0 


LoopsE 


121-126 


17.9 


28.0 


25.9 


N Terminus 


1-3 


109.7 8 


28.7 


40.5 b 


C Terminus 


143-155 


30.1" 


28.9 


16.6 b 



MD denotes B- factor values calculated from the 400 ps 
production trajectory; 2RN2 and 1RNH refer to B-factor values 
taken from Katayanagi et ai (1992) and Yang et al (1990), 
respectively 

' Certain residues in this element exhibit unrealistically high 
fluctuations, probably due to inadequate solvation. 

b Residues 1 and 153 to 155 are disordered in the 1RNH 
structure. 




Figure 1. Ribbon diagram of the backbone trace of 
E. coli RNase Hi; conserved residues and regions involved 
in substrate binding are colored red and yellow, 
respectively 



length (Asp -» Glu and Glu -> Asp) reduced Ac at to 
1 to 8% that of the wild-type without a change in 
K m (Kanaya et ai., 1990). Although Asp 134 is not 
crucial for activity (as the D134H and D134N 
mutant proteins retained considerable activity), it 
has been suggested that it takes part in catalysis but 
not in binding, since the replacement of Asp 134 
by residues other than asparagine and histidine 
dramatically decreased the enzymatic activity 
without seriously affecting substrate binding and 
tertiary structure (Haruki et ai., 1994). His 124 is also 
involved in the catalytic function, as its mutation 
to Lys, Gin or Glu dramatically lowered the fcat 
value, without significantly affecting the K m value, 
as compared to those of the native enzyme (Oda 
et a/., 1993a). Two mechanisms have been proposed 
for the catalytic function of the enzyme. (1) A 
single-metal-ion carboxylate-hydroxyl relay mech- 
anism (Nakamura et a/., 1991). (2) A two-metal- 
ion mechanism first suggested for E. coli DNA 
polymerase I (Yang et a/., 1990; Beese & Steitz, 1991). 
The latter is supported by the finding that Mn z+ 
binds at two sites in the RNase H domain of HIV 
reverse transcriptase (Davies et ai, 1991), whereas 
the former is supported by the identification of a 
single divalent cation binding site in E. coli RNase 
Hi by X-ray (Katayanagi et al, 1993a), NMR (Oda 
et a/., 1991), chemical modification and kinetic 
studies (Uchiyama et ai., 1994). 

As both NMR spectroscopy and X-ray crystallog- 
raphy have revealed unusual structural and 
dynamical features of E. coli RNase Hi, we have 
carried out a 500 ps molecular dynamics (MD) 
simulation of the enzyme in the presence of explicit 
water molecules. There are five main objectives of 
our study First, to evaluate the ability of MD 
simulations to reproduce dynamic behaviour on the 
ps timescale. Second, to complement NMR spectro- 
scopic and X-ray crystallographic data, e.g. by 
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Figure 2. Plot of the root mean 
square deviation (RMSD) of the 
backbone from the 2RN2 crystal 
structure as a function of simulation 
time. 



determining order parameters of catalytically 
important residues which could not be quantified 
by NMR (Mandel et a/., 1995) and evaluating 
conformational changes of residues involved in 
crystal contacts or disordered in the X-ray structure. 
Third, to assess the degree of overlap between 
NMR spectroscopy and X-ray crystallography in 
describing protein mobility Fourth, to provide a 
physical basis for the protein dynamics observed 
by NMR spectroscopy (Mandel et al„ 1995) and 
X-ray crystallography (Yang et al., 1990; Katayanagi 
et al„ 1992). Fifth, to correlate the structural and 
dynamical features of certain residues/regions to 
biological function. Thus, both atomic fluctuations 
(B-factors) and re-orientational motions of heteronu- 
clear bonds (order parameters) computed from the 
simulation are compared with experiment. The 
reasons for significant discrepancies, the physical 
basis and the plausible functional consequences of 
the observed protein dynamics are presented in the 
Discussion section. 



Results 

A plot of the root-mean-square deviation (rmsd) 
of the backbone from the 2RN2 structure as a 
function of simulation time (Figure 2) shows that the 
rmsd initially rises but after 100 ps it fluctuates 
between 1.0 and 1.7 A. These values are consistent 
with those found in other protein simulations. Thus, 
the first 100 ps is included in the equilibration 
phase, whereas the remaining 100 to 500 ps part of 
the trajectory is employed in analysis. Configur- 
ations from every 0.1 ps of the production trajectory 
have been averaged to yield a mean structure. In 
computing the ^-factors (Brooks et al„ 1988), order 
parameters (Levy et ai., 1981, 1982) and effective 
internal correlation times (Philippopoulos & Lim, 
1994), overall translational and rotational diffusion 
have been removed by superimposing the backbone 
of each configuration in the 400 ps trajectory 
onto the backbone of the 2RN2 structure using 
a mass-weighted least-squares fitting algorithm 



(Kabsch, 1976). A list of hydrogen bonds involving 
conserved residues that are observed in the 
simulation is presented in Table 2, where Wat 
denotes a crystal water molecule whose numbering 
is taken from Katayanagi et al. (1992), and Bui 
denotes water overlaid in the simulation. A 
hydrogen bond is defined by a heavy-heavy and a 



Table 2. Hydrogen bonds of conserved residues (and the 
backbone carbonyl oxygen of the preceding residue) 
observed in the simulation 



Donor/ 


Donor/ 


DCray 


<r>MD 


rmsd 


Acceptor 


Acceptor 


(A) 


(A) 


(A) 


Thr9(0) 


Watl75(0) 


2.56 


2.71 


0.12 


AsplO(N) 


Gly23(0) 


3.02 


2.89 


0.14 


Asp 10(O) 


Gly23(N) 


2.98 


2.94 


0,16 


Aspl0(O S2 ) 


Bull 184(0) 




3.42 


2.02 


Met47(0) 


Ala51(N) 


3.08 


3.14 


0.21 


Glu48(N) 


Asn44(0) 


3,13 


2.97 


0.15 


Glu48(0) 


Bul3341(0) 




2.79 


0.16 


Glu48(0 £l ) 


Serf 1(0*) 


2.60 


2.71 


0.13 


Glu48(0 £l ) 


Bul4232(0) 




2.86 


0.59 


Glu48(0 2 ) 


Watl56(0) 


3.32 


2.83 


0.31 


Glu48(O c2 ) 


Bul3158(0) 




3.56 


1.70 


Thr69(0) 


Vall21(N) 


3.11 


3.08 


0.22 


Asp70(N) 


. Asp70(O 51 ) 


2,87 


2.87 


0.21 


Asp70(O 6 ') 


Watl66(0) 


2.82 


2.94 


0.50 


Asp70(O 81 ) 


Watl66(0) 


2.78 


2.93 


0.33 


Asp70(O 5 ') 


Watl75(0) 


3.39 


2.70 


0.15 


Asp70(O* 2 ) 


Watl66(0) 


2.95 


3.10 


0.38 


Asp70(O 52 ) 


Bul3 158(0) 




2.90 


0.57 


Ser71(N) 


Thr69(0*') 


3.19 


3.20 


0.14 


Ser71(0) 


Arg75{N) 


3.26 


3.33 


0.14 


Ser71(0 Y ') 


Val74(N) 


2.90 


3.22 


0.20 


Glul29(0) 


Cysl33(N) 


3.08 


2.91 


0.14 


Glu 129(0) 


Cysl33(S r ) 


3.42 


3.23 


0.16 


Asnl30(N) 


Hisl27(0) 


2.88 


3.12 


0.21 


Asnl30(N") 


Wat 175(0) 


3.26 


3.12 


0.28 


Cysl33(0) 


Alal37(N) 


2.84 


2.93 


0.13 


Aspl34(N) 


Asnl30(O) 


3.08 


3.03 


0.18 


Aspl34(0) 


Argl38(N) 


2.82 


3.00 


0.16 


Aspl34(0 51 ) 


Argl38(N c ) 


3.16 


3.23 


0.60 


Aspl34(0 6t ) 


Bull 184(0) 




3.27 


1,86 


Aspl34(0 82 ) 


Argl38(N c ) 


3.05 


3.22 


0.32 



The rxray distances are from the 2RN2 structure after 
hydrogens have been built in; <r>MD and rmsd are average 
distances from the production trajectory and their corresponding 
rmsd, respectively. 
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light-heavy donor-acceptor distance of ^ 3.5 A and 
s$2.5 A, respectively 



Structural analysis 

Most protein-protein charge-charge interactions 
(involving side-chains) observed in the 2RN2 
structure (Katayanagi et al, 1992) are preserved in 
the simulation. In particular, the hydrogen bond 
network involving Arg46, considered responsible 
for rigidifying the overall protein conformation, has 
remained intact. A number of interactions between 
the C terminus (starting at Asnl43) and residues 
from P3 and ocA are maintained in the simulation, 
as well as a crystal water molecule bridging the C 
terminus and |33, Aspl48(N)-Watl73-Arg41(N). 
These contacts serve to rigidify the C-terminal 
conformation (see also Discussion), 

However, there are some protein-protein charge- 
charge interactions that are in dynamic exchange or 
are not preserved; these involve residues directly 
participating or in the immediate vicinity of 
residues involved in inter-molecular contacts in the 
2RN2 structure (Katayanagi et al., 1992), Therefore, 
some of the non-preserved interactions appear to 
be a result of the local environment relaxing to 
energetically more favorable conformations during 
the simulation. For example, in the 2RN2 structure, 
the backbone and side-chain oxygens of Asp94 have 
inter-molecular contacts with neighbouring Argl32 
and Gln4 and this may have resulted in the 
disruption of the Asp94(0 52 )-Thr92(0 Yl ) hydrogen 
bond in the handle region during the simulation. 
Furthermore, the side-chains of Arg29, Arg31, 
Asp70 and Argl32 hydrogen bond with residues in 
a neighbouring molecule in the 2RN2 structure; the 
close proximity of Arg29, Arg31, Asp70 and Argl32 
to Arg27, Glu32, Ser68 and Asnl30, respectively, 
may have caused several interactions between the P 
sheet and ocE involving the latter residues to be 
unstable in the absence of crystal contacts: the 
Ser68(O 7l )-Asnl30(N 82 ) interaction is partly pre- 
served and two out of the four hydrogen bonds 
involving Arg27 in the 2RN2 structure are not stable 
in the simulation. 

Another factor that may affect some of the 
changes in the hydrogen bonding patterns relative 
to the X-ray structure is the different pH states of 
the X-ray structure (pH 9.0) and the simulated one 
(pH 5.5), resulting in different protonation states for 
AsplO and the histidines. For example, in the 2RN2 
structure, AsplO(0 52 ) is found to be 2.63 and 3.23 A 
away from the backbone nitrogen and oxygen of 
Glyll, respectively; the proximity of an anionic 
Aspl0(O 82 ) to the carbonyl oxygen of Glyll is, 
however, surprising and may be due to an 
inter-molecular non-bonded contact with neigh- 
bouring Lys87(N c ). In the dynamics-averaged 
structure, the neutral side-chain atom of AsplO(H 62 ) 
hydrogen bonds instead to a water molecule. As 
another example, the reported intramolecular 
interactions in the 2RN2 structure between N 51 and 



N c2 of His 1 14 and the backbone carbonyl oxygens of 
AlallO and Glu61, respectively is in apparent 
contradiction to the singly protonated state of 
Hisll4 expected under the crystallization con- 
ditions (pH 9) and its measured pK a value (piC a < 5). 
These two hydrogen bonds were not found 
when hydrogen atoms were built into the 2RN2 
structure and they are not seen in the aver- 
age structure, as deprotonated Hisll4(N 51 ) and pro- 
tonated Hisll4(N e2 ) are hydrogen-bonded to 
Cys63(H N ) and Leu59(0), respectively 

In contrast to protein-protein interactions, most 
protein-water hydrogen bonds found in the 2RN2 
structure are not maintained during the simulation. 
Several of these non-preserved interactions are 
mainly between C-terminal residues and water 
molecules, as well as all hydrogen bonds of Glul47 
and residues C-terminal to TyrlSl with water 
molecules. However, a number of water molecules 
still play a structural role in the dynamics average 
structure. In many cases, these water molecules 
bridge different secondary structure elements, 
thereby contributing to the overall tightening of the 
protein conformation (see Table 2). 

Protein-protein and protein-water hydrogen 
bonds involving backbone amide nitrogens (and 
carbonyl oxygens by virtue of the peptide bond) are 
of particular interest in this study, as they provide 
insight into the physical basis of backbone dynamics 
observed by NMR. Most of the backbone-protein 
hydrogen bonds observed in the 2RN2 structure 
persist throughout the trajectory The exceptions 
involve mostly hydrophobic residues in helices and 
loops and charged residues in the vicinity of 
hydrogen bonding contacts with another molecule 
in the 2RN2 lattice (e.g. Tyr28, Arg31 and Asp 148) 
as well as Hisl27, which undergoes a confor- 
mational change, probably due to its positively 
charged state in the simulation. In contrast, many 
backbone-crystal water hydrogen bonds are not 
preserved, although in several cases, the 2RN2 
water molecule involved in the hydrogen bond is 
either replaced by another crystallographic or a bulk 
water molecule. Furthermore, several hydrogen 
bonds involving crystal water molecules occur in 
the dynamics average but not in the 2RN2 structure. 



Isotropic B-factors 

Both theoretical and experimental B-factors of the 
N, C a , and C(=0) backbone atoms are averaged for 
each secondary structure element and they are 
listed in Table 1. The experimental values are taken 
from PDB entries 2RN2 (Katayanagi eta/., 1992) and 
1RNH (Yang et al, 1990), whereas the calculated 
values are based on the 100 to 500 ps portion of the 
trajectory. The average B-factors correlate with 
secondary structure, Residues in P-strands and a 
helices have generally lower mean-square displace- 
ments (B-factors) than residues in non-regular 
structures; the termini of the polypeptide chain 
have very high mean-square displacements, in 
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Figure 3. Representative re-orien- 
tational auto-correlation functions 
for the backbone N-H bond from 
each of the four classes described 
in the text, (a) Met50 (class I), (b) 
Lys96 (class II). (c) His 124 (class III), 
(d) GIyl23 (class IV). 



accord with the finding that the electron densities 
for these regions are very weak in the 2RN2 
structure and are absent in the 1RNH structure. 
Quantitative agreement (within the experimental 
uncertainty) is obtained between the 1RNH and 
computed average B-factors for the first three 
helices, oeA ( ocB, and ocC (Table 1). Apart from 
the terminal and handle regions, the computed 
average B-factors are generally lower than the X-ray 
values, as found in earlier studies (Petsko & Ringe, 
1984; Post, 1989; Chandrasekhar et a/., 1992); 
possible reasons for this trend are presented in 
Discussion. 



N-H Re-orientational auto-correlation 
functions 

The re-orientational auto-correlation functions for 
the backbone N-H bond of 149 out of the 155 
residues (i.e. excluding the N terminus and five 
prolines) have been calculated from the molecular 
dynamics trajectory over the period 100 to 500 ps. 
They can roughly be categorized into four classes. 

(1) Class I: correlation function decay within 20 ps 
of correlated time to a well defined plateau value. 

(2) Class II: decay to a plateau value within a 
correlated time of more than 20 ps. (3) Class III: 
initial decay to a plateau value, then further decay 
with no plateau reached. (4) Class IV: no plateau 
reached within 200 ps correlated time. Representa- 
tive correlation functions from each class are shown 
in Figure 3. The correlation function of Gly 126 is the 
most erratic: large oscillations are exhibited 
throughout the correlated time. Out of the 115 
class I residues, 102 belong to regular secondary 
structure elements, i.e. oc-helix or P-strand. In 
contrast, only five of the 34 residues whose 
correlation functions belong to classes II to IV (all of 
them in class II) belong to an oc-helix or P-strand. In 
the handle region, residues Lys91, Thr92 and 
Asp94-Lys96 belong to class II, whereas Ala93 
belongs to class III. 



Generalized order parameters 

The generalized order parameters S 2 MD of the 149 
backbone N-H bonds calculated from the 100 to 
500 ps portion of the molecular dynamics trajectory 
are listed in Table 3 and compared with correspond- 
ing experimental data (S Z NMR ) in Figure 4 (Mandel 
et al. t 1995). S Z MD values and their uncertainties for 
N-H pairs with correlation functions that did not 
reach a well-defined plateau value within 200 ps 
correlated time are listed with a question mark 
beside the value in Table 3. For 48 out of 116 
residues quantified experimentally, the S 2 MD differ by 
more than 0.055 from S 2 NMR \ possible reasons for this 
discrepancy will be presented in Discussion. 
Discord with experimental values is most pro- 
nounced (greater than 0.2) for residues 2-3, 29 to 31, 
71, 126 and 155 (Figure 4). 

Generalized order parameters of the backbone 
C a -H a atom pairs of 141 residues (14 glycines 
excepted) have also been computed using equation 
(6) and they are compared to the N-H S Z MD values 
in Figure 5. The C a -H a S Z MD values are generally 
higher than corresponding N-H values throughout 
the primary sequence: the average differences for P 
strands and a helices are 0.061 and 0.040, 
respectively (For only seven residues, Leu 14, Arg27, 
Tyr28, Met50, Val54, Lys60 and Gln76, are the 
differences between the two sets of S Z MD values 
smaller than 0.01.) The same trend has also been 
observed in molecular dynamics studies of a zinc 
finger peptide (Palmer & Case, 1992) and BPTI 
(Smith et al. t 1995). The restricted mobility of C a -H a 
relative to the N-H bond may reflect the fact that 
the peptide group can rotate relatively freely with 
little energetic cost to the rest of the backbone 
conformation, whereas re-orientation of the C a -H a 
bond necessarily affects the conformation of the 
attached (sometimes bulky) side-chain, which, in 
turn, may be involved in interactions that are crucial 
for the overall molecular fold. Futhermore, the 
C a -H a Smd values are generally more uniform than 
the corresponding N-H values, due to the absence 
of hydrogen bonding propensity 
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REMARKS 

L Examiner Interview 

On April 11, 2006, Applicant's representative, Stephanie Amoroso, conducted a tele- 
phonic interview in connection with the subject application with Examiner Lori A. Clow. Ac- 
cording to Applicant's file, during the interview, Applicant's representative and the Examiner dis- 
cussed the prior art of record and possible claim amendments to clarify the differences between 
the prior art of record and claim 1. No agreement on allowable subject matter was reached. 

TL Claim Status 

Claims 2-5 have been cancelled without prejudice or disclaimer subsequent to being 
withdrawn from consideration by the Examiner as being directed to non-elected subject matter. 
Applicant reserves the right to pursue the cancelled claims in one or more divisional applications. 

Claims 1 and 26, 27, 29 and 30 have been amended to be directed to methods that call 
for " utilizing the calculated dimensions, chemical and/or electrostatic properties obtained in step b 
to identify compounds that contain functional groups that can be accommodated [by a cavity on a 
target protein]." Claim 28 has been similarly amended to be directed a method that calls for 
" utilizing the calculated dimensions of step b or the chemical and/or electrostatic properties ob- 
tained in step c to identify compounds that contain functional groups that can be accommodated 
[by a cavity on a target protein]. " Upon reading the specification, it is self evident that the calcu- 
lated physical properties of a cavity are used to identify compounds that include functional groups 
that may be accommodated in the cavity. To the extent that further support is required for "utiliz- 
ing" such information to identify compounds that bind within an cavity that has been identified on 
a target protein, such support is found at, e.g., page 12, line 21 et seq., page 18, line 3, et seq., 
page 25, lines 11-14, and page 27, lines 24-25. 

The further amendments to claim 1 are for the purpose of clarity only and do not 
change the scope or meaning of claim 1. Thus, these further amendments to claim 1 do not add 
new matter to the application. 
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By this Amendment, no new matter is added to the application. 
Upon entry of this Amendment, claims 1 and 6- 31 are pending. 
III. Response to Rejections 

The rejections set forth in the Final Office Action are summarized and addressed as 

follows. 

(i) Rejections Under 35 U.S.C. §112, first paragraph (written description). The Ex- 
aminer has rejected claims 25 and 31 as allegedly including new matter. The rejections are trav- 
ersed as follows. 

With respect to claim 25, the Examiner's position is that the specification fails to pro- 
vide adequate written description for identifying an allosteric cavity of a target protein using nu- 
clear magnetic resonance, crystal structure analysis, calorimetric values from thermodynamic 
studies, or computer modeling. The Examiner's position is believed to be mistaken. The specifi- 
cation at page 11, lines 3-5 provides explicit support for using crystal structure analysis, nuclear 
magnetic resonance, and computer modeling to identify a cavity. Thus, inclusion of these meth- 
ods in claim 25 does not constitute new matter. With regard to identifying a cavity using calo- 
rimetric values from thermodynamic studies, the specification at page 11, lines 3-4 sets forth that 
the "cavity of the target protein may be identified by any of several well known techniques" (em- 
phasis added) and states explicitly that the methods recited with reference to identifying a cavity 
are not limiting. The specification at page 10, lines 13-21 discloses that the functionally critical 
site of a target protein may be inferred from microcalorimetric analysis. Thus, the specification 
identifies microcalorimetric analysis as a means to analyze protein structure. The disclosure that 
"any" well known technique may be used to identify a cavity on a target protein, coupled with the 
explicit disclosure of that microcalorimetric analysis may be used to determine a particular site, 
i.e., a functionally critical site, is sufficient to show that the inventors had possession of using 
calorimetric analysis to identify a cavity. Thus, the specification provides adequate written de- 
scription for the inclusion of "calorimetric values from thermodynamic studies" in claim 25. Ac- 
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cordingly, the specification includes adequate written description for all of the methods of identi- 
fying a cavity that are recited in claim 25. The present rejection should be withdrawn. 

With regard to claim 31, the Examiner's position is that the specification fails to pro- 
vide written description for identifying an allosteric cavity using a technique that includes identify- 
ing thermal p-factors. In first response, Applicant notes again that the specification discloses that 
"any" well known method may be used to identify a cavity (specification at page 11, lines 2-3) 
and note further that a means of identifying a functionally critical site of a target protein include, 
"identification of p-factors on the target protein structure as imaged using crystal or nuclear mag- 
netic resonance (NMR) images, either by thermal p-factors on the atoms of target protein from 
crystal structure or flexible loops inferred from NMR signals" {see specification at page 10, lines 
13-18). See also specification at page 10, lines 20-25 for further discussion of p-factors. When 
taken together, these passages show possession of using identification of using NMR or crystal 
structure analysis and further comprising identifying thermal p-factors to identify a cavity. Thus, 
for at least this reason, the present rejection should be withdrawn. 

Moreover, with further regard to "p-factors," one of ordinary skill in the art would 
understand that identification of p-factors are a basic aspect of crystallography that is part of every 
crystal study and that p-factors may be calculated from NMR studies. The Examiner is directed, 
for example, to an enclosed publication, Philippopoulos et al., J. Mol. Biol., 1995 254:771-792, 
which describes the derivation of p-factors by crystallography and NMR. Thus, one of ordinary 
skill in the art would understand that the disclosure that a cavity on a target protein may be identi- 
fied by "crystal structure analysis [or] NMR" (see page 11, line 4) is sufficient to show posses- 
sion of identifying an allosteric cavity "using nuclear magnetic resonance or crystal structure 
analysis, and further comprising] identifying thermal p-factors," as called for in claim 31. For 
this reason additionally, the specification provides adequate written description for claim 31. 

For at least the reasons set forth above, the specification provides adequate written de- 
scription for the full scope of claims 25 and 31. Reconsideration of claims 25 and 31 and the 
withdrawal of the rejection thereof for containing new matter is requested, accordingly. 
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S 2 N mr in the order parameter determined by NMR (Mandel et a/., 1995; in cases for which both S 2 
and Sj have been determined experimentally, only Sj, the order parameter of motions with correlation 
times \int < 50 ps, is shown); S Z MD is the order parameter computed from the simulation. A question mark 
before the value indicates that the N-H correlation function did not reach a well-defined plateau value 
within 200 ps correlated time, i/w (NMR) is the effective correlation time of internal motion determined 
by NMR (in some cases it reflects motion on a timescale that is longer than the simulation length, e.g. 
Arg31). ijnr(MD) is the correlation time of internal motion computed from the simulation. Uncertainties 
of all quantities are shown in parentheses, theoretical uncertainties are determined by the jackknife 
method described in Methods, and values < 0.005 are not shown. Only Tto(MD) values higher than 
1 ps are shown, xmt (MD) values whose theoretical uncertainty is equal to or higher than their absolute 
value (due to extreme noise in the auto-correlation function) are not shown. 



Effective correlation times 

Calculated effective correlation times, T int , based 
on the 400 ps production trajectory are listed in 
Table 3. Only x int values greater than 1 ps are shown, 
due to the coordinate sampling interval of 100 fs 
(see Methods) that has resulted in re-orientational 
correlation functions with low precision on the fs 
timescale. The T im values were computed from 
equation (7), with T defined as the point at which 
the correlation function matched the corresponding 
Smd value (calculated from equation (6)) to within 
0.001. For the majority of residues that have been 
experimentally assigned a motional timescale 
(Mandel et ai., 1995), x int values seldom exceed 20 ps 
(the sensitivity lower limit of the NMR experiment). 
For this reason, the choice of 40 ps for the block size 
used in the "jackknife" algorithm (see Methods) for 
estimation of the statistical uncertainties of S 2 and 
Ti„t values from the MD trajectory is justified in most 
cases. 

A relatively small number of T im values were 
determined experimentally, due to the conservative 
nature of the fitting protocol and low sample 
concentrations that gave rise to nuclear Overhauser 
enhancement (NOE) values of low precision 
(Mandel et ai., 1995). This fact, along with the 
limited precision to which T tnt values can be 
computed from the 400 ps trajectory, makes 
comparison between theoretical and experimental 
ti„t values problematic. However, there are some 
interesting patterns. Most otC residues (positions 83 
to 88) are assigned a i/„ tf in contrast to neighbouring 
otB (Table 3). This behaviour is consistent with the 
relatively high B-factors of residues 83 to 88 



(Table 1). Most residues of termini, loops and turns 
possess a T in t, which is computed to be lower than 
25 ps in most cases. Finally long stretches of 
residues in ocA, aB and ocD (positions 104 to 109) are 
not assigned a motional timescale x jm from the 
present simulation, indicating severely restricted 
librational motion of these N-H bonds with 

Tim < 1 PS. 

Discussion 

Comparison with X-ray data 

X-ray isotropic B-factors reflect protein mobility 
on two distinct timescales: 10" 15 to 10~ n seconds (fast 
thermal fluctuations) and 10~ 9 to 10 3 seconds 
(conformational changes). Since only highly popu- 
lated conformers are observable via X-ray diffrac- 
tion, the method is relatively insensitive to motions, 
such as domain hinge-bending or aromatic ring 
flips, that occur either too slowly or too infrequently 
to yield distinct conformers and hence detectable 
electron densities (Petsko & Ringe, 1984). Currently 
only motions on the 10" 15 to 10" 10 seconds timescale 
are observable in common by X-ray diffraction and 
molecular dynamics simulations; this type of 
fluctuation is expected to dominate the backbone 
dynamics of regular secondary structure elements 
that are part of tightly packed protein hydrophobic 
cores. In general, the observed mean-square atomic 
displacement, {x 2 ), can be written as a sum of three 
contributions (Petsko & Ringe, 1984); 

<*> = OOc +<**>/ (1) 
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Figure 4. (a) Generalized order parameters S 2 of the backbone N-H bonds calculated from the 100 to 500 ps portion 
of the molecular dynamics trajectory (b) Corresponding NMR data, (c) Difference between calculated and experimental 
values. 



where <x 2 > v is the contribution due to thermal 
fluctuations (vibrations within a single harmonic 
potential well), <x 2 > c is the contribution due to static 
and dynamic conformational disorder (molecules 
existing in different conformations in the crystal) 
and <x^>/ is the contribution due to lattice disorder 
(crystal heterogeneity). 

To make a quantitative comparison of the X-ray 
B-factors with the computed values, an attempt to 
subtract out the lattice disorder contribution was 
made by translating the experimental values so that 
the lowest value was equal to the corresponding 
theoretical value. Thus, the 2RN2 data were shifted 
by 3.71 A 2 downward, whereas the 1RNH data were 
not translated, as the lowest 1RNH value was lower 
than the corresponding computed one. The com- 
parative profiles of the calculated and experimental 
main-chain B-factors averaged on a per residue 
basis in Figure 6 show that the overall qualitative 
features of the experimental data are reproduced 
well by the simulation. The largest discrepancy 
between theory and experiment is found in the 
reverse turn of the handle, where the B-factors of 
residues 92 to 95, especially Ala93, are much higher 



than the corresponding X-ray values. On the other 
hand, there is good agreement between S Z NMR and 
Smd values for these residues (Table 3). Since Lys91, 
Ala93 and Asp94 are involved in a number of 
intermolecular hydrogen bonding contacts in the 
2RN2 structure, the removal of these constraints 
during molecular dynamics may have resulted in 
higher atomic fluctuations for the reverse turn 
relative to those observed in the X-ray structure. In 
fact, this region underwent a significant displace- 
ment (by about 5 A with respect to the 2RN2 
coordinates) in a direction away from the protein 
core during equilibration. As a result, this region 
became exposed to vacuum, thereby undergoing 
large atomic fluctuations relative to the rest of the 
protein. Therefore, the above crystal contacts may 
have had an important effect on the structure and 
dynamics of the reverse turn. 

Apart from the handle region and the termini, the 
computed average B-factors are generally lower 
than the X-ray values. Some possible reasons for the 
observed systematic error are: (1) the least-squares 
fitting procedure employed in separating internal 
motions from overall molecular diffusion (inadver- 
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tently removing some internal motion). (2) The 
limited timescale of the simulation (limiting 
exploration of conformational space, thus decreas- 
ing the computed values). (3) Inadequacies in the 
empirical force-field (leading to an underestimation 
of mobility). (4) Under-correction of the lattice 
disorder in the crystal (increasing the experimental 
values). (5) Under-correction of the reflections for 
radiation damage (causing systematic overestima- 
tion of B-factors; Petsko & Ringe, 1984). (6) Phonon 
vibrations in the crystal that would be absent in a 
solution environment (resulting in enhanced X-ray 
B-factors). 

Comparison of B-factors from the simulation and 
the 1RNH structure reveals close agreement for 
several residues in aA, ocB and aC, which also 
contain some residues with X-ray B-factors that are 
lower than the computed values. Thus, the 
least-squares fitting procedure employed in (1), 
above, does not seem to be a significant contribution 
to the observed discrepancy Comparison of the 
computed B-factors with those from a 200 ps 
production trajectory yields significant differences 
only for some of the turns (such as residue 60), 
whose B-factors are increased upon doubling the 
trajectory; this, in conjunction with the NMR 
finding of internal correlation times only in irregular 
secondary structure elements (Mandel et aJ., 1995), 
suggests that the limited timescale of the simulation 
as in (2), above, contributes to at least part of the 
observed difference between computed and X-ray 
B-factors for the loops, turns and termini. 



However, it seems unlikely that the significant 
differences between experimental and computed 
B-factors of regular secondary structure elements 
stem from conformational disorder in the crystal 
structure not sampled by the present trajectory 
Such large-scale motions would probably be 
inhibited, especially in the tightly packed hydro- 
phobic core of strands pi to 4. Comparison of the 
1RNH and 2RN2 values (Figure 6 and Table 1) 
suggests that the different experimental procedures 
(multiwavelength anomalous diffraction versus 
multiple isomorphous replacement, respectively) 
and refinement protocols (Yang et ai., 1990; 
Katayanagi et ai., 1990) may contribute to the 
observed discrepancy Furthermore, for a few 
proteins like ribonuclease A (Petsko, 1975), crambin 
(Teeter & Hendrickson, 1979), and avian pancreatic 
polypeptide (Glover et a/., 1983), which diffract to 
resolutions comparable to that of small molecule 
crystals, i.e. 1 A or higher, a number of atoms 
have B-factors as low as 3-4 A 2 , similar to those 
observed for atoms in small organic crystal 
structures. Other small, well-diffracting proteins 
like rubredoxin (solved to 1.2 A resolution) have 
B-factors as low as 7 A 2 (Petsko & Ringe, 1984), 
comparable to the average computed B-factors 
of the (J strands (6.3 A 2 ) and a helices (7.3 A 2 ) of 
RNase Hi. Taken together, the large differences 
between the 1RNH and 2RN2 values, as well as 
the relatively high 2RN2 B-factors for the 
regular secondary structure elements, indicate that 
the observed discrepancy in the P strands and a 




Residue number 

Figure 5. Comparison of the generalized order parameters S 2 of the 141 backone C a -H (broken line) versus N-H bonds 
(continuous line) calculated from the 100 to 500 ps portion of the molecular dynamics trajectory 
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Residue number 

Figure 6. (a) Comparison of calculated main-chain E-factors averaged on a per residue basis (broken line) with 
experimental values (continuous line) taken from PDB entries 2RN2. (b) With 1RNH. The 2RN2 values in (a) are 
downshifted by 3.71 A 2 (see the text). 



helices may result from factors (4) to (6) stated 
above. 

Comparison with NMR data 

In analogy to the computed B-factors, the S Z MD 
values are generally overestimated relative to the 
Snmr values of most residues (see Figure 4). The 
major cause of the observed "systematic" error does 
not stem from the least-squares fitting procedure 
employed in removing tumbling, since correlation 
functions in which overall tumbling was not 
removed were very similar to those obtained after 
the least-squares fitting (plateau values were within 
0.02); this indicates that on the simulation time- 
scale, the computed order parameters are insensi- 
tive to the manner in which overall translational and 
rotational diffusion is treated. Another possible 
reason for the discrepancy may stem from the 
assumption of a fixed N-H bond length of 1.02 A 
inherent in the experimental fitting procedure by 
which NMR relaxation data were fitted to spectral 
density models containing one, two or three 
model-free dynamical parameters (Mandel et a/., 
1995). In contrast, bond oscillations have been 



allowed to contribute to the Sm D values presented in 
Table 3, as the SHAKE algorithm (van Gunsteren & 
Berendsen, 1977) that permits longer simulation 
time steps by fixing covalent bonds to hydrogen 
atoms was not used in the present work. 
Comparison between S Z MD values shown in Table 3 
and values computed with a fixed N-H distance of 
1.02 A shows that the latter are higher by about 
0.005. This is, therefore, not a significant contri- 
bution to the observed differences. A more 
significant contribution may be due to the neglect of 
zero-point vibrational motion of the N-H and C a -H 
bond. This is suggested by backbone heteronuclear 
S 2 values for BPTI (Bruschweiler, 1992) and a 
zinc-finger peptide (Palmer & Case, 1992). A 
systematic difference between classical statistics (as 
in this work) and quantum statistics was found to 
be between 5% and 6%, independent of the amino 
acid type and position in the protein; i.e. order 
parameters from a classical analysis were greater 
than those from a quantum mechanical analysis by 
an average of 0.05. In the following, possible reasons 
for significant discrepancies between MD- and 
NMR-derived order parameters (greater than 0.05), 
as well as the physical basis of unusual or 
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interesting patterns of the observed dynamics, are 
discussed. 

ot-Helices and ^-strands 

Table 4 lists average S 2 values per secondary 
structure element and compares them with values 
determined by NMR (Mandel et aL, 1995). Average 
S Z MD values of residues in p strands and a helices 
typically fluctuate around 0.89(±0.02). There is 
reasonably good agreement between the theoretical 
and experimental average S 2 values of the a helices 
and (3 strands: the difference between the two sets 
of mean S 2 values is less than 0.05 (the estimated 
correction for zero-point vibrational motion) for 
four of the five P strands (pi to 4) and three of the 
five a helices (aB, aC, and otE). It is somewhat higher 
(0.06-0.07) for otA and aD, and is as high as 0.10 for 
P5. In all cases, the mean S 2 values for the regular 
secondary elements are overestimated relative to 
experiment. 

P5 strand 
The average SIimr 

of p5 (0.77(±0.01)) is unusual, 
although P-sheet S 2 values in the neighbourhood of 
0.80 have been reported for other proteins (Clore 
etaL, 1990; Stone eta/., 1992; Farrow etai, 1994). The 
NMR relaxation study (Mandel et a/., 1995) found 
that P5 residues (116 to 120) exhibit a pattern of 



Table 4. Weighted mean order parameters for secondary 
structure elements 



Secondary 


Residue 






Residues 


structure 


range 


SltMR 


Smd 


quantified 


PI 


4-13 


0.85 (0.01) 


0.88 


7 


P2 


18-27 


0.87 (0.01) 


0.89 


9 


P 


32-42 


0.85 (0.01) 


0.87 


10 


P4 


64-69 


0.85 (0.01) 


0.90 


5 


P5 


115-120 


0.77 (0.01) 


0.87 


5 


<P> 




0.84 


0.88 




aA 


43-58 


0.86 (0.01) 


0.91 


9 


aB 


71-80 


0.91 (0.01) 


0.91 


4 


aC 


81-88 


0.87 (0.01) 


0.90 


6 


aD 


100-112 


0.83 (0.01) 


0.90 


11 


aE 


127-142 


0.89 (0.01) 


0.90 


11 


<ot> 




0.87 


0,91 




Turn) 2 


14-17 


0.70 (0.01) 


0.85 


3 


Turn23 


28-31 


0.86 (0.01) 


0.66 (0.02) 


4 


LoopA^ 


59-63 


0.77 (0.01) 


0,86 (0.01) 


3 


Handleco 


89-99 


0.81 


0.85 


10 


LoopsE 


121-126 


0.80 (0.02) 


0.82 (0.01) 


5 


(bops) 




0.79 


0.84 




C Terminus 


143-155 


0.78 


0.82 


11 



The last column shows the number of residues in that element 
quantified experimentally In some instances. S 2 NM r values exist for 
a limited number of residues, e.g. in aB. S 2 M d values and their 
uncertainties were computed as weighted means, including only 
S 2 of residues that have been quantified experimentally. S% MR 
for a helices and p strands are taken from Mandel et a/. (1995), 
Table 4, whereas values for non-regular secondary structure 
elements are computed as weighted averages of S z from Table 1 
of the same reference. In cases where both an S 2 and an Sj value 
are reported experimentally, the Sj value has been chosen for 
the averaging. <p>, <a> and (bops) are average S 2 values over 
the five p strands, five a helices and five loops, respectively 



alternate high and low mobility with an average S 2 
of 0.74(±0.01) for Ilell6, Trpll8 and Trpl20 and 
0.84 (±0.02) for Lysll7 and Glull9. The simulation 
has reproduced this alternating motional pattern, 
and the S 2 MD values of Lysll7 and Glull9 are close 
to the corresponding experimental values. The 
differential mobility of the 05 residues correlates 
with the hydrogen bonding patterns of the backbone 
nitrogen atoms. The N-H mobility of Glnll5, 
Lysll7 and Glull9 is restricted, since their amide 
hydrogens interact with the carbonyl oxygens of 
Cys63, Val65 and Leu67, respectively In contrast, 
there are no stable protein or water hydrogen bonds 
to the backbone amide hydrogens of Ilell6, Trpll8 
and Trpl20, allowing their N-H bonds to exhibit 
greater mobility The only analogous structural 
situation to residues 116 to 120 occurs at the 
opposite edge of the P-sheet (strand 03), where the 
amide hydrogens of Glu32 ( Thr34 and Ser36 are not 
involved in hydrogen bonds, whereas those of 
Lys33 and Phe35 interact with the carbonyl oxygens 
of Leu26 and Ala24 of 02, respectively. However, an 
alternating dynamical pattern for these residues 
was not observed via NMR relaxation: the S 2 NMR of 
residues 32 to 36 are uniform (0.83(±0.02)). The 
simulation, in contrast, does indicate a pattern of 
alternate high mobility for Glu32, Thr34 and Ser36 
and lower mobility for Lys33 and Phe35 (see 
Table 3): in particular, the S 2 MD differences between 
Lys33 and Glu32 (0.09) and between Phe35 and 
Thr34 (0.06) are greater than the uncertainties. 

Coiled-coil interface 

Significant discrepancies between the NMR data 
and the simulation are seen for several residues in 
the coiled-coil interface between helices aA and aD 
(Yang et a/., 1990; Katayanagi et a/., 1992). The 
interactions between Arg46 of a A and AsnlOO and 
Asp 102 of aD, as well as between Glu57 of aA and 
Argl06 of aD observed in the 2RN2 structure, are 
all preserved in the simulation. NMR-derived N-H 
order parameters for this region indicate a pattern 
of reduced mobility for the charged residues Arg46, 
Aspl02 and Argl06 (mean S 2 NMR = 0.85(±0.01)) 
and of enhanced mobility for the hydrophobic resi- 
dues, Leu56, Leul03, Leul07 and Leulll (mean 
Snmj? = 0.80(±0.01)); no data have been reported for 
the other residues involved in stabilizing the 
interface, Leu49, Ile53 and Glu57. In contrast, the 
MD-derived backbone N-H order parameters for 
the charged residues, Arg46, Glu57, Aspl02 and 
Argl06 (mean S 2 MD = 0.91(±0.01)) are similar to 
those for residues involved in hydrophobic inter- 
actions between aA and aD: Leu49, 56, 103, 107, 1 1 1 
and Ile53 (mean S Z MD = 0.89(±0.01)). Relative to the 
experimental values, the S 2 MD values are overesti- 
mated by about 0.05 for the charged residues and 
roughly 0.1 for the hydrophobic ones. Although 
order parameters are only sensitive to ps-ns type 
motions, individual values may still be ensemble 
averages over several conformational substrates 
inter-converting on timescales longer than the ps-ns 
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timescale. As significant conformational exchange 
terms have been identified for most of the residues, 
especially all the leucines comprising oeA and otD 
(Mandel et al. t 1995), the lower S 2 observed 
experimentally may reflect the existence of ad- 
ditional less rigid conformations in equilibrium 
with the one sampled in the trajectory and 
interconverting on the jis-ms timescale. In particu- 
lar, there may be a coiled-coil conformational state 
where the hydrophobic residues stabilizing 
the interface may not be well packed (thus 
giving them an extra degree of mobility), but the 
overall structure of ocA and aD is maintained 
(thus not affecting the mobility of the charged 
residues). 



aC helix 

The S 2 NMR and S 2 MD values for ocC are in close 
agreement (Table 3). Both theory and experiment 
indicate that the mean aC S 2 values (Table 4) do not 
deviate from the average S 2 values expected for 
regular secondary structure, whereas the mean 
aC B-factors (Table 1) are the highest among 
all regular secondary structure elements (although 
the B-factors in 1RNH do not indicate high 
backbone mobility for aC). This difference 
between X-ray B-factors and NMR order par- 
ameters illustrates the variance in dynamical 
information that the two sets of data can provide. 
B-factors are directly related to atomic rms 
fluctuations. N-H order parameters, on the other 
hand, are only sensitive to angular fluctuations and 
the local environment of the N-H bond, i.e. 
proximity to hydrogen-bond partners, and may 
therefore reflect more localized motions as com- 
pared to B-factors. 

Several protein-protein hydrogen bonds involv- 
ing aC residues, not present in the X-ray structure, 
are formed during the simulation: Trp81(0)- 
Asn84(N), His83(0)-Lys87(N), Arg88(N e )-Lys91- 
(O) and Arg88(N Tll )-Trp90(O). The absence of the 
His83(0)-Lys87(N) interaction from the 2RN2 
structure (normally expected to be present as part 
of the a helix) may have been due to the 
intermolecular interactions of Trp85, Lys86, Lys87 
and Gly89 with GlnllS, Glyl23 and Hisl24 of an 
adjacent molecule (Katayanagi et a/., 1992). The 
presence of hydrogen bonding partners for the 
amide hydrogens of all the aC residues (except for 
His83) probably results in the relatively high S 2 
values of helix C. Interestingly, despite the 
additional interactions found in the simulation, 
the computed B-factors for the aC residues 
(especially 83 to 88) are higher than average 
B-factors for the other a helices; this trend is in 
accord with the 2RN2 B-factors. Thus, the observed 
high backbone B-factors of aC may be the result of 
collective translational displacements of the helix, in 
which the individual N-H bonds preserve their 
orientations, thus accounting for the relatively high 
S 2 NMR and S 2 MD values. 



Loops and the C terminus 

The average S 2 of the two isolated reverse turns 
(residues 14 to 17 and 28 to 31), two loops (residues 
59 to 63 and 121 to 126), the handle region (residues 
89 to 99) and the C-terminal region (residues 143 to 
155) are listed in Table 4 and compared with 
corresponding experimental values. The average S 2 
values of residues in turns, loops and the C 
terminus are lower than those in the P strands and 
a helices. This is consistent with the fact that an S 2 
and /or a motional timescale have been assigned 
experimentally to residues belonging exclusively to 
non-regular secondary structure elements, reflect- 
ing the enhanced flexibility of these regions. For 
two of the three loops that have been implicated in 
binding, the p5-aE loop and the handle region, as 
well as the C-terminal region, the average theoreti- 
cal and experimental S 2 values are in reasonable 
accord (within 0.04). Agreement between theory 
and experiment for the loop between ocA and (34 is 
better than the average S 2 values indicate: two of the 
three Sn MR values (for Leu59 and Glu61) are in 
accord with the individual S 2 MD values (within the 
uncertainties) and only the S 2 NMR of His62, which had 
the second lowest goodness-of-fit (11.83) in the 
protein, was found to differ significantly from the 
computed number (by 0.06). However, the (32-03 
turn, which does not contain any residues 
implicated in binding and/or catalysis, and the 
(31-(32 turn, have average S 2 MD values that differ 
significantly from the corresponding S 2 NMR values 
(by 0.14 and 0.20, respectively); the discrepancy is 
probably due to errors in the simulation (insufficient 
sampling and possibly inadequate solvation) and/ 
or the experimental fitting protocol (low goodness- 
of-fit values). 

Handle region 

Agreement between the S 2 NMR and S 2 MD values for 
the handle region is one of the best in the protein, 
as shown in Tables 3 and 4. Trp90 has the highest 
Snmr and S 2 md value in the handle region. Consistent 
with the NMR data, the X-ray backbone tempera- 
ture factors of Trp90 are among the lowest for 
residues 81 to 99 in the 2RN2 structure (Katayanagi 
et a/. t 1992), whereas in the 1RNH structure (Yang 
et ai., 1990) rigidity is less exclusive to Trp90. In the 
2RN2 structure, Trp90 interacts with aC and the 
handle region on the other side of the reverse turn 
via two hydrogen bonds, Trp90(H N )-Trp85(O) and 
Trp90(O)-Val98(H N ) (Katayanagi et a/., 1992). Only 
the former interaction is maintained during the 
simulation, so that the latter is not responsible for 
the NMR-observed rigidity of the Trp90 backbone 
(Mandel et ai., 1995). In addition, the carbonyl 
oxygen of Gly89 forms a hydrogen bond with 
Lys91(H N ), thereby further rigidifying the Trp90 
N-H vector through the Gly89-Trp90 peptide bond. 
The Gly89(0)-Lys91(H N ) hydrogen bond is not 
seen in the 2RN2 crystal structure (Katayanagi 
et ai., 1992). Note that the Trp90(H N )-Trp85(O) and 
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Figure 7. (a) The scalar products of the Glyl23 and (b) Alal25 N-H bonds relative to their orientation at t = 0 of the 
production trajectory as a function of simulation time. 



Gly89(0)-Lys91(H N ) interactions do not appear 
to be screened by solvent and furthermore, 
the backbone and side-chain atoms of Trp90 do 
not form stable hydrogen bonds, with water 
molecules. 

|J5-aE loop 

The average S? NMR and S 2 MD values of this loop are 
within the experimental uncertainty (Table 4); the 
biggest discrepancy in the two sets of values is due 
to Gly 126. whose N-H auto-correlation function did 
not decay to a plateau value, causing its S 2 MD to differ 
significantly from the experiment. The origin of 
the relatively significant T/ nf values (162(± 128) and 
82(±36)ps) found experimentally for Glyl23 and 
Alal25 (Mandel etaL, 1995), the only residues in the 
P5-aE loop assigned a motional timescale, may be 
linked to a concerted transition in the scalar 
products of the Glyl23 and Alal25 N-H bonds 
relative to their initial orientation at about 80 ps into 
the production trajectory (Figure 7). This transition 
coincides with sharp transitions at 80 to 85 ps in 
the Lysl22(0)-Alal25(N) distance as well as the 
Glyl23 4> and \|/ dihedral angles (Figure 8). At 
r = 85 ps, a water molecule (Bul3397) forms a 



hydrogen bond with Alal25(H N ). Another water 
molecule (Bul3586) comes within hydrogen-bond- 
ing distance of Hisl24(0) at t=90ps, and at 
t= 112 ps, the same water molecule also forms a 
hydrogen bond with Glul31(O e2 ). Both these 
interactions remain stable for the remainder of the 
simulation. The water bridge Hisl24(0)-Bul3586- 
Glul31(O e2 ) (and to a lesser extent, Alal25(H N )- 
Bul3397) is responsible for restricting the mobility 
of the Alal25 N-H bond, due to the partially 
double-bond nature of the peptide bond connecting 
Hisl24 C = 0 and Alal25 N-H. Finally a third 
water molecule (Bul3315) comes to contribute to the 
stabilization of the backbone conformation of His24 
and Alal25 by forming hydrogen bonds simul- 
taneously with their amide hydrogen atoms at 
approximately t = 250 ps. As a precaution against 
artifacts due to insufficient equilibration giving rise 
to the transition at ca 85 ps, the S 2 of Gly 123 was 
recalculated from the last 300 ps of the trajectory 
and found to be 0.77(±0.03), i.e. much higher 
than the value of 0.57(±0.12) calculated including 
the transition and the experimental value of 
0.54(±0.07). As there are no exchange terms found 
in the p5-otE loop, implying that ps-ns motions 
are probably the only contribution, this transition 
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appears to account for the S Z NMR values of Glyl23 and 
Alal25, and thus the dynamics of this loop. 



Conserved residues 

Of the seven most highly conserved residues in 
RNase Hi, backbone dynamical parameters have 
been determined experimentally only for Asp 10, 
Ser71, Asp 130 and Aspl34 (Mandel et ai., 1995); the 
other three catalytically important residues could 
not be quantified due to exchange line broadening 
(for Glu48 and Asp70) and resonance overlap (for 
Hisl24). Only Asnl30 was found to have an 
unexceptional S 2 NMR (0.83( + 0.05)), and the only one 
in accord with theory (S 2 MD = 0.89(±0.01)). 



Acidic residues 

The S 2 N mr value for Asp 10 has the highest 
uncertainty in the protein (0.72 (±0.1 3)), whereas 
the S 2 NM r value for Asp 134 is the highest in the 
protein (1.00(±0.01)). The S 2 MD value for AsplO 



(0.88), near the upper limit of the experimental 
number (0.85), is consistent with the occurrence of 
this residue as part of the regular secondary 
structure and its low computed and X-ray B-factors; 
thus, this suggests that the S 2 NMR value may be an 
overestimate of backbone mobility The S 2 MD value of 
Asp 134 (0.92) is significantly lower than the S 2 NMR 
value. In addition, the S 2 MD values of neighbouring 
Cysl33 (0.92), Glul35 (0.90) and Leul36 (0.91) are 
also lower than the S 2 NMR value of Asp 134. Since 
the S 2 MD values of Cysl33 and Leu 136 (Glul35 was 
not quantified experimentally) are close to their 
experimental values, it is possible that the S 2 NMR 
value of Aspl34 may be in error. Glu48 possesses 
the highest S 2 MD value (0.93) among the four 
absolutely conserved acidic residues whereas 
Asp70, the single residue between 04 and otB, has 
a S 2 md (0.90) in between the average S 2 MD values of a 
P strand and an a helix (Table 4). The observed 
relative rigidity of the four absolutely conserved 
acidic residues (average S 2 MD = 0.90) is probably due 
to the hydrogen bonding contacts made by the 
backbone nitrogen, the preceding carbonyl oxygen 
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Figure 8. The distance time-series of Lysl22(0)-Alal25(N) and the $ and v|/ dihedral angles of Glyl23 as a function 
of simulation time plotted in panels (a), (b) and (c), respectively 
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and perhaps the side-chain carboxylate oxygens 
(Table 2). 

Ser71 

This has one of the lowest S Z NMR values 
(0.60(±0.08)) and highest chemical exchange terms 
(4.6(+1.4) s" 1 ) in the protein, indicating substantial 
flexibility on the ps-ns and us-ms timescale, 
respectively (Mandel et aL, 1995). On the other 
hand, a hydrogen bond between Ser71(N) and 
Thr69(O yI ) is observed in both the 1RNH and 2RN2 
structures (Yang et aL, 1990; Katayanagi et aL, 1992). 
This bond is stable during the simulation, giving rise 
to a Smd value of 0.88( + 0.01), consistent with the 
restricted motion of an N-H bond in its hydrogen 
bonding state. There seems to be a structural 
difference in the environment of Ser71 in going 
from the crystal to the solution state related to the 
side-chain conformation of Thr69. This may be 
caused by crystal contacts in the 2RN2 structure, as 
there are two intermolecular hydrogen bonds in the 
crystal lattice connecting neighbouring Asp70 
carboxylate oxygens to Lys87(N ? ) of an adjacent 
molecule (Katayanagi et aL, 1992). 

His 124 

There are no experimental data for this catalyti- 
cally important residue; its auto-correlation func- 
tion shows a class Ill-type behaviour (Figure 3) 
yielding a S 2 MD value of 0.68(±0.02). This is midway 
between the S Z MD values of Glyl23 and Alal25, 
which are in agreement with the corresponding S Z NMR 
values to within experimental uncertainty (Table 3). 
The flexibility of the His 124 N-H bond is consistent 
with the finding that its backbone (and side-chain) 
atoms do not interact with any protein atoms, 
although they do form and break hydrogen bonds 
with several water molecules during the simulation. 
The backbone dihedral angles cj) and \|/ of Hisl24 
underwent no transitions during the simulation, in 
contrast to those of Glyl23 and Ala 125. However, a 
significant displacement of the His 124 side-chain 
with respect to its 2RN2 coordinates occurred 
during equilibration (see Figure 10); this may be due 
to the relief of several intermolecular crystal 
contacts involving the backbone and side-chain 
nitrogens of His 124. 

Interestingly the catalytically important Hisl24 
adopts different conformations with respect to 
the side-chains of AsplO, Glu48 and Asp70 in 
the crystal structures of the free enzyme (Yang 
et aL, 1990; Katayanagi et al., 1992) and the 
active immobilized-metal-affinity-chromatography 
(IMAC) HIV-1 RNase H domain (Chattopadhyay 
eta/., 1993). In the 2RN2 structure (Katayanagi etal., 
1990) the imidazole of His 124 is roughly 10 A away 
from the active site residues, whereas in the 1RNH 
structure (Yang et aL, 1990), it is within 4 A of the 
AsplO and Asp70 carboxylates (see Figure 9). The 
conformation of the imidazole in the IMAC 
structure is similar to that in 2RN2, in that it is 




Figure 9. Hydrogen-bond network in the active site of 
the dynamics-averaged structure. Hydrogen bonds are 
represented as broken lines. For clarity the water 
molecules are not labelled. 



significantly displaced from the active site (Chatto- 
padhyay et a/., 1993). Crystal contacts between 
AsplO and Asp70 with Lys87 of a neighbouring 
molecule in the 2RN2 structure may be responsible 
for the side-chain conformation of Hisl24. However, 
crystal packing is found not to be responsible for the 
IMAC conformation. From ['H]NMR pH titration 
experiments (Oda et al., 1993a), the pK a value of 
Hisl24 was found to be 7.1. Since this value is 
slightly higher than the "normal" p/C a value of an 
imidazole group (around 6.5 for His residues in 
denatured barnase), His 124 was suggested to be in 
an acidic environment; however, as the pK a of 
Hisl24 is lower than that of Hisl27 (7.9), whose 
side-chain N e2 hydrogen bonds to Glull9(O el ), 
His 124 was thought not to interact directly with the 
acidic side-chains (Oda et ai., 1993a); hence, it was 
concluded that it adopted a solution conformation 
similar to that in the 1RNH structure. The present 
simulation, however, suggests that another plaus- 
ible cause for the observed pK a value of His 124 may 
be its interaction with Glul31 via a water molecule 
(Hisl24(0)-Bul3586-Glul31(O e2 ); not present in the 
X-ray structures) rather than its proximity to the 
active site acidic residues. 




Figure 10. Conformations of the active site residues and 
Hisl24 in the 2RN2 (light grey), 1RNH (dark grey) and 
dynamics-average structures (black). 
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Relevance to biological function 

Experimental studies on RNase Hi have identified 
several regions that are important for substrate 
binding and/or catalytic reaction (see Introduction), 
The regions implicated are: (1) the active site, (2) the 
P1-P2 turn, (3) the basic protrusion, consisting of otC 
(residues 82 to 88) and the handle region (residues 
89 to 99) and (4) the p5-ocE loop. The possible 
connection between the dynamical properties of the 
above regions and biological function is discussed 
below, 

The active site 

In both proposed models for the catalytic 
mechanism of the enzyme, a water molecule is 
activated by either a divalent cation or a carboxylate 
(general base) for nucleophilic attack of the P-O 3 
bond; in a variant of the relay mechanism, Hisl24 
has been thought to enhance the catalytic rate by 
directly or indirectly (through a water molecule) 
interacting with the general base (Oda er al., 1993a). 
A number of water molecules form stable hydrogen 
bonds with the four completely conserved acidic 
residues (see Figure 9). In some instances, these 
water molecules bridge two or more acidic residues 
together, thus shielding the repulsion of their 
negatively charged side-chains; Gfu48(O e2 )-Watl66- 
Asp70(O 51 ), Aspl0(O 52 )-Bulll84-Aspl34(O 51 ), and 
Glu48(O e2 )-Bul3158-Asp70(O 62 ). Asp70 is also 
linked to the highly conserved Asnl30 via a water 
bridge: Asp70(O 61 )-Watl75-Asnl30(N 52 ). The latter 
water molecule (Watl75) further connects Asp70 to 
the neighbour of the catalytically important AsplO, 
i.e. Asp70(O 51 )-Watl75-Thr9(O). In addition, two 
crystallographic water molecules are connected by a 
stable hydrogen bond: Watl66-Wat211 (Figure 9). 
These interactions are stable throughout the 
simulation (Table 2). 

One or more of the structural water molecules 
identified above may participate in the enzymatic 
reaction. This is supported by the finding that 
replacement of Asp 134 with residues other than 
Asn or His decreased activity significantly without 
serious effects on binding (Haruki era/., 1994). Since 
the D134N and D134H mutant proteins have a 
carbonyl O and an imino N at the 5 position similar 
in volume and electronegativity to that of the 
carboxyl O of Aspl34, it is likely that D134N(0 81 ) 
and D134H(N 51 ) can retain the hydrogen bonding 
bridge with Bulll84-Aspl0(O 6 ). Hence, the 
Aspl34(O 51 )-BuI1184-Aspl0(O 52 ) interaction may 
be required to hold the water molecule and /or 
AsplO at their appropriate positions in the active 
site for RNA hydrolysis. Furthermore, the low 
computed and observed B-factors, as well as the 
high S 2 MD values of AsplO, Glu48, Asp70 and 
Asp 134, suggest that the precise alignment of these 
catalytically important residues may be crucial for 
the enzymatic reaction (Lim & Tole, 1 992) . 
Interestingly, the loss of activity of the isolated 
HIV-1 RNase H domain correlates with relatively 



low backbone S Z NMR values (<0.7) of the two residues 
corresponding to positions 10 and 48 in the E. coli 
enzyme (the S 2 NMR values of the other two acidic 
residues corresponding to positions 70 and 134 have 
not been quantified; Powers et al., 1992). 

P 1-$2 turn 

Residues 14 to 17, in particular Asnl6, have been 
implicated in substrate binding by NMR measure- 
ments (Nakamura et ai., 1991) as well as theoretical 
models (Yang er ai., 1990; Katayanagi et al, 1992). 
Furthermore, the residue preceding the pi~p2 turn, 
Cysl3, has also been implicated in binding as the 
introduction of a disulfide bond between Cysl3 and 
Asn44 -» Cys, and thereby the additional rigidifica- 
tion of the local conformation, nearly abolished 
enzymatic activity Both computed and NMR S 2 
values of residues Cysl3-Asnl6 indicate substantial 
mobility The flexibility of Cysl3-Asnl6 probably 
facilitates the occurrence of conformational changes 
upon substrate binding. This is supported by the 
observation that rigid body docking of a RNA-DNA 
decamer onto RNase Hi resulted in a few clashes 
involving the P1-P2 turn, which can hence be 
relieved by conformational adjustment (Yang et al., 
1990). The loss of activity associated with the 
artificial 13-44 disulfide bond correlates with the 
lack of hydrogen bonding propensity of Cysl3(S T ): 
although the Cysl3(S Y )-Glyl5(0) and Cysl3(S Y )- 
Glyl8(0) distances are ^3.5 A both in the 2RN2 
and MD-average structures, Cysl3(H Y )-Glyl5(0) 
and Cysl3(H Y )-Glyl8(0) are greater than 2.5 A (see 
Results for the definition of H-bond) in the 
dynamics-average structure and the 2RN2 structure 
with built-in hydrogens, implying the absence of 
hydrogen bonding interactions. 

The basic protrusion 

The role of residues 81 to 99 in binding 
DNA/RNA-hybrid and catalysis has been 
suggested in a model of E. coli RNase Hi complexed 
with a 21-mer DNA/RNA hybrid (Katayanagi er al., 
1992) and alanine-scanning mutagenesis of the basic 
amino acid residues (Kanaya er ai, 1991a). The 
absence of this lysine and tryptophan-rich region in 
the isolated HIV-1 RNase H domain has been 
correlated with the lack of productive substrate 
binding, hence its inactivity (Kanaya et al., 1991a; 
Stahl et al., 1994). However, incorporation of the 
basic protrusion into the equivalent position of the 
isolated HIV-1 RNase H domain restored activity 
(Stahl et al., 1994), providing further evidence for 
the substrate-binding role of this region. 

The dynamic properties of ocC are obscure: 
unusually high B-factors in the simulation and the 
2RN2 structure, but not in 1RNH, and S 2 MD and S 2 NMR 
values representative of putative secondary struc- 
ture. In contrast, the handle region (residues 89 to 
99) exhibits the lowest average E-factor among all 
non-regular secondary structure elements in both 
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1RNH and 2RN2 structures, and correspondingly 
the second highest average S Z NMR and S 2 MD values. 
Only Lys96 has been assigned S 2 NMR values for both 
a slow and a fast timescale (Mandel erai., 1995). On 
the other hand, chemical exchange terms have been 
identified for five residues in the basic protrusion, 
indicating substantial mobility on the |is-ms 
timescale. Site-specific mutations of each of the 
basic residues to alanine yielded K m values that 
were three to fivefold higher than that of the 
wild-type, except for Lys86; similar replacement of 
either Trp85 or 90 also resulted in greatly enhanced 
K m but reduced V max values, suggesting that 
conformational changes possibly involving disrup- 
tion of the hydrogen bond connecting positions 85 
and 90 (see above) affect substrate binding affinity 
and catalytic efficiency These findings suggest that 
the concentrated positive charge of the basic 
protrusion and its conformation, rather than the 
ps-ns mobility of the handle region, may be an 
important determinant for binding (Kanaya er al, 
1991). 

p5-aE loop 

Residues 121 to 126 have been implicated in 
binding substrate from model structures of the 
enzyme-substrate complex (Yang et al. t 1990; 
Katayanagi erai., 1992); in addition, Hisl24 has been 
suggested to participate in the catalytic reaction (see 
Introduction). In the crystal structure of the inactive 
HIV-1 BH10 RNase H domain (Davies et al., 1991), 
this loop is disordered. In contrast, in the X-ray 
structure of a catalytically active IMAC-RNase H 
domain (Chattopadhyay er ai., 1993), the loop is 
ordered, but it exists in more than one conformation. 
This finding is consistent with the simulation: the 
P5-aE loop undergoes a sharp transition, resulting 
in two distinct conformers (see Figures 7 and 8). 
Furthermore, the observed pK a value of His 124 
appears to be consistent with the formation of the 
Hisl24(0)-Bul3586-Glul31(O e2 ) interaction during 
the simulation; in the latter conformation, Hisl24 is 
displaced several A from the active site carboxy- 
lates, as found in the 2RN2 and IMAC structures 
(Figure 9). This conformation is not in apparent 
contradiction to the suggested role of Hisl24 in 
regulating catalysis, since the flexibility of the p5-oeE 
loop may help to position the conserved Hisl24 
adjacent to the cluster of carboxylate side-chains 
upon substrate binding so that it can still take part 
in the enzymatic reaction. 

Concluding Discussion 

The present simulation of E. coli RNase Hi in 
solution yields results in overall accord with 
experimental results: agreement between the com- 
puted and X-ray B-factors is qualitative, whereas 
agreement between theoretical and experimental 
order parameters is for many residues as good as 
quantitative. Plausible explanations for the observed 
discrepancy between theory and experiment have 



been presented. The overestimation of the X-ray 
B-factors relative to the computed values has been 
deduced to be largely due to under-correction of the 
lattice disorder and reflections for radiation damage 
as well as the presence of phonon vibrations in the 
crystal, all of which resulted in enhanced X-ray 
B-factors. On the other hand, the overestimation of 
the S 2 MD values of most residues relative to the 
corresponding S 2 NMR values for differences ^0.05 has 
been attributed to the neglect of zero-point N-H 
vibrational motion. Differences between S 2 MD and 
Snmj? greater than (±0.05), found for residues in the 
P1-P2 and P2-P3 turns, the coiled-coil interface 
between helices A and D, and the p5-ocE loop, are 
probably due to inadequacies in the simulation (e.g. 
limited sampling of the present trajectory or 
inadequate solvation) and/or the experimental 
fitting protocol. 

Order parameters for several highly conserved 
residues around the active site, whose S 2 NMR could 
not be derived due to exchange broadening or 
resonance overlap, have been computed; these 
include the catalytically important residues, Asn45, 
Glu48, Asp70 and Hisl24. Although high B-factors 
have generally been correlated with low order 
parameters, there is no such correlation between the 
relatively high B-factors, yet high order parameters 
of otC were found both experimentally and in the 
simulation. This difference illustrates the variance 
in dynamical information that B-factors and order 
parameters can provide. B-factors are directly 
related to atomic rms fluctuations, whereas N-H 
order parameters are only sensitive to rotational 
motions and the local environment of the N-H 
bond, and may therefore reflect more localized 
motions as compared to B-factors. 

The physical basis of some interesting and 
unusual patterns of the observed dynamics has 
been elucidated. The alternating motional be- 
haviour of residues 116 to 120 in P5 and residues 32 
to 36 in P3 correlates with the hydrogen bonding 
patterns of the backbone nitrogen atoms. The 
NMR-observed rigidity of the Trp90 backbone 
(Mandel et al., 1995) is found to be due to hydrogen 
bonds made by Trp90(H N ) with Trp85(0) and the 
preceding atom Gly89(0) with Lys91(H N ), rather 
than the Trp90(O)-Val98(H N ) hydrogen bond in the 
crystal structure (Katayanagi etaL, 1992), which was 
not maintained during the simulation. The relatively 
significant x/ nt values found experimentally for 
Glyl23 and Alal25 (Mandel er a/., 1995) have 
been correlated with a concerted transition in the 
scalar products of the Glyl23 and Alal25 N-H 
bonds relative to their initial orientation at about 
80 ps into the production trajectory (Figure 7). An 
alternative basis for the measured pK a value of 
His 124 (7.1) is proposed: the interaction of 
His 124(0) via a water bridge to Glul31(O cZ ) seems 
consistent with its pK a value being slightly higher 
than the normal piC a value (6.5) of an imidazole 
group but lower than that of a histidine (»8) whose 
side-chain is interacting directly with an acidic 
group. 
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The simulation has also provided key insights 
into the possible roles of several residues in the 
biological function of RNase H. A number of water 
molecules formed stable hydrogen bonds with the 
four absolutely conserved acidic residues in the 
active site, and one or more of these structural water 
molecules may participate in the enzymatic 
reaction. In particular, the Aspl34(O SI )-Bulll84- 
AsplO(0 52 ) interaction may be required to hold the 
water molecule and/or Asp 10 at their appropriate 
positions in the active site for RNA hydrolysis. The 
high Smd values and low B-factors of the active site 
acidic residues imply that precise positioning of 
these residues may be crucial for the enzymatic 
reaction. On the other hand, the relatively low Smd 
values and high J3-factors of residues known to 
participate in substrate binding, Cysl3, Glyl5 and 
Asnl6 in the (3i— P2 turn, and residues 123 to 126 in 
the Ps-oce loop, suggest an induced-fit mechanism 
for binding. In addition, the substantial flexibility of 
ocC may play a role in positioning the unusually 
rigid handle region during substrate binding. 
Interestingly, the P5-aE loop is found to exist in two 
conformations, consistent with the IMAC-RNase H 
domain finding that this loop exhibits "discrete 
disorder". It appears that the flexibility of the p5-otE 
loop not only facilitates substrate binding, but also 
helps position the catalytically important His 124 
from a conformation distant from the active site 
prior to binding to an optimal one near the 
carboxylate side-chains once the substrate is bound. 

Methods 

Simulation procedure 

A molecular dynamics simulation, using the program 
CHARMM (Brooks et al. t 1983), was carried out on fully 
solvated E. coli RNase H] consisting of 2460 protein atoms 
and 11,484 TIP3P Qorgensen et ai, 1983) water atoms, 
starting from the X-ray structure of Katayanagi et al. (1992 
PDB entry 2RN2). The simulation was carried out at 
pH 5.5, the pH at which the NMR experiments were 
performed (Mandel et ai, 1995). The measured pK a values 
of all carboxyl groups (Oda et al,, 1994) and histidines 
(Oda et ai, 1993a) indicate that at pH 5.5, the aspartic and 
glutamic acids are deprotonated except for Asp 10 
(pK a = 6.1) and the histidines are protonated except for 
Hisl 14 (pK a < 5.0) . The pK a values of the other ionizable 
groups in RNase Hi have not been measured and are 
assumed to be normal: thus, all lysines and arginines are 
positively charged. 

The forces on the atoms and their dynamics were 
calculated with the program CHARMM (Brooks et al., 
1983; version 23) and an all-hydrogen parameter set 
{MacKerell et al, 1992). All protein and solvent atoms 
were propagated according to Newton's equations using 
a leap-frog integrator and a timestep of 1 fs. The water 
molecules were subjected to a deformable mean-field 
potential (Berkowitz & McCammon, 1982); this accounts 
for interactions with the infinite bulk waters that were not 
treated explicitly in the simulation. Electrostatic inter- 
actions were truncated by applying a force-switching 
function (Steinbach & Brooks, 1994) in the region 
between 1 1 and 15 A, whereas van der Waals* interactions 
were truncated at 15 A by a shifted potential (Brooks 



et al, 1983), The non-bond list was updated every 10 fs at 
a 16 A cutoff. 

Initially the 2RN2 structure was subjected to 100 steps 
of steepest descent and 500 steps of adopted-basis 
Newton-Raphson minimization in the presence of strong 
(10 kcal/A 2 ) harmonic constraints on all heavy atoms; this 
relieved close contacts in the protein without disrupting 
its overall conformation. At this stage, the rmsd from the 
crystal coordinates was 0.072 A for protein heavy atoms 
and 0.040 A for backbone atoms. The protein was then 
immersed in a 33.3 A radius sphere of TIP3P water 
molecules Qorgensen et ai, 1983) constructed from an 
equilibrated configuration of T1P4P waters at a pressure 
of 1 atmosphere and a density of 0.0334 molecules/ A 3 . 
Overlaid water molecules whose oxygens and hydrogens 
were within 2.8 A and 2.0 A, respectively, of protein 
heavy atoms or crystal water oxygens were deleted; the 
resulting system consisted of 2460 protein atoms, 225 
crystal waters and 3603 overlaid bulk water molecules 
yielding a total of 13,944 atoms. The solvated system was 
then subjected to 350 steps of steepest descent 
minimization, during which harmonic restraints on the 
protein heavy atoms were gradually reduced to zero; 
the rmsd from the 2RN2 structure at this point was 
0.203 A for protein heavy atoms and 0.120 A for backbone 
atoms. 

The solvated system was then prepared for subsequent 
dynamics by an initial 6 ps phase of heating, during 
which initial velocities were randomly assigned from a 
Maxwell-Boltzmann distribution at 25 K and re-scaled at 
temperatures increasing from 25 to 300 K (for 6 ps). This 
was followed by a 24 ps equilibration stage, during which 
velocities were re-scaled to 300 K at 2 ps intervals if the 
average temperature fell outside a 10 K window (for 
14 ps) and a 5 K window (for 10 ps). A total of 3.2 ps of 
production dynamics were completed per day on an IBM 
RS/6000 590 workstation. Co-ordinates were saved every 
100 fs resulting in 4000 frames for analysis. The results 
described in the text are based on a 400 ps trajectory 
(characterized by an average temperature of 296 K), 
which was used to compute B-factors (Brooks et al., 1988), 
internal correlation functions and order parameters (Levy 
et al., 1981, 1982; Philippopoulos & Lim, 1994). The 
definitions and equations used to compute these 
quantities are summarized in the next section. 



Analysis 

The B-factors, B t , were computed using the relation: 

B, = ^<A/f> (2) 

where An is the atomic displacement for atom i calculated 
from the trajectory 

Qnt(t) is the correlation function describing internal 
motion and is given by, 

<^W-<r*Wr(%$f) (3) 

with %, the angle between the internuclear vectors r(t) 
and r(0), and r measured in a molecule-fixed frame. The 
internal correlation function can be redefined in terms of 
the order parameter, S 2 , and the internal correlation time, 
T/ m . This is possible as the internal correlation function of 
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a Markovian motional process can be written as a series 
of exponentials (Lipari & Szabo, 1982): 



The jackknife procedure differs from the commonly 
employed blocking technique (Allen & Tildesley, 1990): 



Qnt(t) = I a,e-' A < 



(4) 



A simple approximation to equation (4) that renders C/ m (t) 
exact at r=0 and t= oo is (Lipari & Szabo, 1982); 



C im (t) = S 2 +(l-S 2 )e-^ 



where 



S 2 = Urn C ( „,(t) = f <r 6 >"' £ l^gj^ 



and 



1 

C, nr (0)-S 2 



(C /nf (r) - S 2 )dt 



(5) 



(6) 



(7) 
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<X/» 2 



(10) 



where x, is the mean value of % for each block and <x> 
the mean of x/ over the k blocks. Although both methods 
reduce the correlations among neighbouring data sets, the 
jackknife procedure has the advantage of reducing the 
bias towards data sets showing greater amplitude of 
fluctuation, since X/ is computed from all the blocks in the 
jackknife procedure rather than just block i as in the 
standard blocking technique. For sufficiently large k t the 
jackknife procedure has a further advantage of reducing, 
in general, the estimated error, which is a function of Ic z 
(equation (9)), whereas the uncertainty computed from 
the standard blocking method (equation (10)) is inversely 
proportional to k. 



In equation (6), Yz m (Q, <(>) are the second-order spherical 
harmonics as functions of the angular spherical coordi- 
nates of the internuclear vector in a molecule-fixed frame. 
In equation (7), Tis the time after which C in f(t) = S 2 , and 
C /m (0) is the value of the correlation function at the first 
coordinate frame of the trajectory 



The jackknife method 

The uncertainties in the order parameters and internal 
correlation times were computed using the jackknife 
procedure (Quenouille, 1956), which was introduced as a 
means of eliminating bias in estimating parameters. If 
quantity % is computed from N data sets (e.g. coordinate 
frames), we wish to determine Ax. The algorithm is as 
follows: 

1. Divide the N sets into k blocks (preferably 10 or 
more). 

2. Compute x-/. the value x from the (k- 1) blocks 
discounting block i. 

3. Calculate x* from each of the k %-> values according 

to 



X ! = k x -(k-\)x-, 



(8) 



Note that x is the weighted mean of x* and x-f- 
4. Ax is then given as the standard error in the mean 



Ax = 



X (xf-<xr» 2 



(9) 



where <x*> denotes the mean of x* calculated over the k 
blocks. 

For the calculation of the uncertainties of order 
parameters and correlation times by this procedure, the 
size of each of the k blocks must be chosen to be no 
smaller than the correlation time of the motional process 
of interest; the purpose of this choice is to ensure that the 
X* values are statistically independent from one another. 
In the present work, we chose k= 10 or a block size of 
40 ps (overall length of the analysis trajectory is 400 ps), 
since most residues have i/ m values <40 ps. 
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